首页> 外文OA文献 >Compression ratios based on the Universal Similarity Metric still yield protein distances far from CATH distances
【2h】

Compression ratios based on the Universal Similarity Metric still yield protein distances far from CATH distances

机译:基于通用相似度量的压缩率仍然有效   蛋白质距离远离CaTH距离

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

Kolmogorov complexity has inspired several alignment-free distance measures,based on the comparison of lengths of compressions, which have been appliedsuccessfully in many areas. One of these measures, the so-called UniversalSimilarity Metric (USM), has been used by Krasnogor and Pelta to compare simpleprotein contact maps, showing that it yielded good clustering on four smalldatasets. We report an extensive test of this metric using a much larger andrepresentative protein dataset: the domain dataset used by Sierk and Pearson toevaluate seven protein structure comparison methods and two protein sequencecomparison methods. One result is that Krasnogor-Pelta method has less domaindiscriminant power than any one of the methods considered by Sierk and Pearsonwhen using these simple contact maps. In another test, we found that the USMbased distance has low agreement with the CATH tree structure for the samebenchmark of Sierk and Pearson. In any case, its agreement is lower than theone of a standard sequential alignment method, SSEARCH. Finally, we manuallyfound lots of small subsets of the database that are better clustered usingSSEARCH than USM, to confirm that Krasnogor-Pelta's conclusions were based ondatasets that were too small.
机译:基于压缩长度的比较,Kolmogorov的复杂性启发了几种无对准距离测量方法,这些方法已成功应用于许多领域。这些措施之一,即所谓的通用相似度(USM),已由Krasnogor和Pelta用于比较简单的蛋白质接触图,表明它在四个小型数据集上产生了良好的聚类。我们报告了使用更大,更具有代表性的蛋白质数据集对该指标进行的广泛测试:Sierk和Pearson使用的域数据集评估了7种蛋白质结构比较方法和2种蛋白质序列比较方法。一个结果是,与使用这些简单联系图的Sierk和Pearson所考虑的任何一种方法相比,Krasnogor-Pelta方法具有更少的域区分能力。在另一项测试中,我们发现基于Sierk和Pearson的相同基准,基于USM的距离与CATH树结构的一致性较低。无论如何,其协议都低于标准顺序比对方法SSEARCH的协议。最后,我们手动找到了数据库的许多小子集,这些小子集使用SSEARCH比USM更好地聚类,以确认Krasnogor-Pelta的结论是基于太小的数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号